Structural zeroes and zero-inflated models
نویسندگان
چکیده
SUMMARY In psychosocial and behavioral studies count outcomes recording the frequencies of the occurrence of some health or behavior outcomes (such as the number of unprotected sexual behaviors during a period of time) often contain a preponderance of zeroes because of the presence of 'structural zeroes' that occur when some subjects are not at risk for the behavior of interest. Unlike random zeroes (responses that can be greater than zero, but are zero due to sampling variability), structural zeroes are usually very different, both statistically and clinically. False interpretations of results and study findings may result if differences in the two types of zeroes are ignored. However, in practice, the status of the structural zeroes is often not observed and this latent nature complicates the data analysis. In this article, we focus on one model, the zero-inflated Poisson (ZIP) regression model that is commonly used to address zero-inflated data. We first give a brief overview of the issues of structural zeroes and the ZIP model. We then given an illustration of ZIP with data from a study on HIV-risk sexual behaviors among adolescent girls. Sample codes in SAS and Stata are also included to help perform and explain ZIP analyses.
منابع مشابه
Hurdle, Inflated Poisson and Inflated Negative Binomial Regression Models for Analysis of Count Data with Extra Zeros
In this paper, we propose Hurdle regression models for analysing count responses with extra zeros. A method of estimating maximum likelihood is used to estimate model parameters. The application of the proposed model is presented in insurance dataset. In this example, there are many numbers of claims equal to zero is considered that clarify the application of the model with a zero-inflat...
متن کاملBayesian Zero- Inflated Poisson model for prognosis of demographic factors associated with using crystal meth in Tehran population
Background: Use of methamphetamine (MA) and other stimulants has increased steadily over the past 10 years. Risk factor evaluation to reduce the problem in the community is one solution to protect people from addiction. This study aimed at using Bayesian zero- inflated Poisson (ZIP) model to investigate the relationship between the number of using crystal meth and some demogr...
متن کاملGeneralized estimating equation based zero-inflated models with application to examining the relationship between dental caries and fluoride exposures
GENERALIZED ESTIMATING EQUATION BASED ZERO-INFLATED MODEL WITH APPLICATION TO EXAMINING THE RELATIONSHIP BETWEEN DENTAL CARIES AND FLUORIDE EXPOSURES Sheng Xu April 16, 2013 In the study of dental caries, the number of caries is frequently characterized by over-dispersion and excessive zeros. In addition, the numbers of caries from the same subject are correlated. Zero-Inflated (ZI) regression ...
متن کاملAssessment of length of stay in a general surgical unit using a zero-inflated generalized Poisson regression
Background: The effective use of limited health care resources is of prime importance. Assessing the length of stay (LOS) is especially important in organizing hospital services and health system. This study was conducted to identify predictors of LOS among patients who were admitted to a general surgical unit. Methods: In this cross-sectional study, the sample included all patien...
متن کاملCount Data Models in SAS ®
Poisson regression has been widely used to model count data. However, it is often criticized for its restrictive assumption of equi-dispersion, meaning equality between the variance and the mean. In real-life applications, count data often exhibits over-dispersion and excess zeroes. While Negative binomial regression is able to model count data with over-dispersion, both Hurdle (Mullahy, 1986) ...
متن کامل